Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 76
Filter
1.
NPJ Digit Med ; 7(1): 98, 2024 Apr 18.
Article in English | MEDLINE | ID: mdl-38637674

ABSTRACT

Accurate prediction of recurrence and progression in non-muscle invasive bladder cancer (NMIBC) is essential to inform management and eligibility for clinical trials. Despite substantial interest in developing artificial intelligence (AI) applications in NMIBC, their clinical readiness remains unclear. This systematic review aimed to critically appraise AI studies predicting NMIBC outcomes, and to identify common methodological and reporting pitfalls. MEDLINE, EMBASE, Web of Science, and Scopus were searched from inception to February 5th, 2024 for AI studies predicting NMIBC recurrence or progression. APPRAISE-AI was used to assess methodological and reporting quality of these studies. Performance between AI and non-AI approaches included within these studies were compared. A total of 15 studies (five on recurrence, four on progression, and six on both) were included. All studies were retrospective, with a median follow-up of 71 months (IQR 32-93) and median cohort size of 125 (IQR 93-309). Most studies were low quality, with only one classified as high quality. While AI models generally outperformed non-AI approaches with respect to accuracy, c-index, sensitivity, and specificity, this margin of benefit varied with study quality (median absolute performance difference was 10 for low, 22 for moderate, and 4 for high quality studies). Common pitfalls included dataset limitations, heterogeneous outcome definitions, methodological flaws, suboptimal model evaluation, and reproducibility issues. Recommendations to address these challenges are proposed. These findings emphasise the need for collaborative efforts between urological and AI communities paired with rigorous methodologies to develop higher quality models, enabling AI to reach its potential in enhancing NMIBC care.

2.
BMJ Open ; 14(4): e074604, 2024 Apr 12.
Article in English | MEDLINE | ID: mdl-38609314

ABSTRACT

RATIONALE: Intensive care units (ICUs) admit the most severely ill patients. Once these patients are discharged from the ICU to a step-down ward, they continue to have their vital signs monitored by nursing staff, with Early Warning Score (EWS) systems being used to identify those at risk of deterioration. OBJECTIVES: We report the development and validation of an enhanced continuous scoring system for predicting adverse events, which combines vital signs measured routinely on acute care wards (as used by most EWS systems) with a risk score of a future adverse event calculated on discharge from the ICU. DESIGN: A modified Delphi process identified candidate variables commonly available in electronic records as the basis for a 'static' score of the patient's condition immediately after discharge from the ICU. L1-regularised logistic regression was used to estimate the in-hospital risk of future adverse event. We then constructed a model of physiological normality using vital sign data from the day of hospital discharge. This is combined with the static score and used continuously to quantify and update the patient's risk of deterioration throughout their hospital stay. SETTING: Data from two National Health Service Foundation Trusts (UK) were used to develop and (externally) validate the model. PARTICIPANTS: A total of 12 394 vital sign measurements were acquired from 273 patients after ICU discharge for the development set, and 4831 from 136 patients in the validation cohort. RESULTS: Outcome validation of our model yielded an area under the receiver operating characteristic curve of 0.724 for predicting ICU readmission or in-hospital death within 24 hours. It showed an improved performance with respect to other competitive risk scoring systems, including the National EWS (0.653). CONCLUSIONS: We showed that a scoring system incorporating data from a patient's stay in the ICU has better performance than commonly used EWS systems based on vital signs alone. TRIAL REGISTRATION NUMBER: ISRCTN32008295.


Subject(s)
Patient Readmission , State Medicine , Humans , Hospital Mortality , Intensive Care Units , Critical Care
3.
J Med Internet Res ; 26: e52880, 2024 Jan 18.
Article in English | MEDLINE | ID: mdl-38236623

ABSTRACT

BACKGROUND: Surgical site infections (SSIs) occur frequently and impact patients and health care systems. Remote surveillance of surgical wounds is currently limited by the need for manual assessment by clinicians. Machine learning (ML)-based methods have recently been used to address various aspects of the postoperative wound healing process and may be used to improve the scalability and cost-effectiveness of remote surgical wound assessment. OBJECTIVE: The objective of this review was to provide an overview of the ML methods that have been used to identify surgical wound infections from images. METHODS: We conducted a scoping review of ML approaches for visual detection of SSIs following the JBI (Joanna Briggs Institute) methodology. Reports of participants in any postoperative context focusing on identification of surgical wound infections were included. Studies that did not address SSI identification, surgical wounds, or did not use image or video data were excluded. We searched MEDLINE, Embase, CINAHL, CENTRAL, Web of Science Core Collection, IEEE Xplore, Compendex, and arXiv for relevant studies in November 2022. The records retrieved were double screened for eligibility. A data extraction tool was used to chart the relevant data, which was described narratively and presented using tables. Employment of TRIPOD (Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis) guidelines was evaluated and PROBAST (Prediction Model Risk of Bias Assessment Tool) was used to assess risk of bias (RoB). RESULTS: In total, 10 of the 715 unique records screened met the eligibility criteria. In these studies, the clinical contexts and surgical procedures were diverse. All papers developed diagnostic models, though none performed external validation. Both traditional ML and deep learning methods were used to identify SSIs from mostly color images, and the volume of images used ranged from under 50 to thousands. Further, 10 TRIPOD items were reported in at least 4 studies, though 15 items were reported in fewer than 4 studies. PROBAST assessment led to 9 studies being identified as having an overall high RoB, with 1 study having overall unclear RoB. CONCLUSIONS: Research on the image-based identification of surgical wound infections using ML remains novel, and there is a need for standardized reporting. Limitations related to variability in image capture, model building, and data sources should be addressed in the future.


Subject(s)
Surgical Wound Infection , Surgical Wound , Humans , Surgical Wound Infection/diagnosis , Employment , Machine Learning , Physical Examination
4.
JAMA Netw Open ; 6(9): e2335377, 2023 09 05.
Article in English | MEDLINE | ID: mdl-37747733

ABSTRACT

Importance: Artificial intelligence (AI) has gained considerable attention in health care, yet concerns have been raised around appropriate methods and fairness. Current AI reporting guidelines do not provide a means of quantifying overall quality of AI research, limiting their ability to compare models addressing the same clinical question. Objective: To develop a tool (APPRAISE-AI) to evaluate the methodological and reporting quality of AI prediction models for clinical decision support. Design, Setting, and Participants: This quality improvement study evaluated AI studies in the model development, silent, and clinical trial phases using the APPRAISE-AI tool, a quantitative method for evaluating quality of AI studies across 6 domains: clinical relevance, data quality, methodological conduct, robustness of results, reporting quality, and reproducibility. These domains included 24 items with a maximum overall score of 100 points. Points were assigned to each item, with higher points indicating stronger methodological or reporting quality. The tool was applied to a systematic review on machine learning to estimate sepsis that included articles published until September 13, 2019. Data analysis was performed from September to December 2022. Main Outcomes and Measures: The primary outcomes were interrater and intrarater reliability and the correlation between APPRAISE-AI scores and expert scores, 3-year citation rate, number of Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) low risk-of-bias domains, and overall adherence to the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) statement. Results: A total of 28 studies were included. Overall APPRAISE-AI scores ranged from 33 (low quality) to 67 (high quality). Most studies were moderate quality. The 5 lowest scoring items included source of data, sample size calculation, bias assessment, error analysis, and transparency. Overall APPRAISE-AI scores were associated with expert scores (Spearman ρ, 0.82; 95% CI, 0.64-0.91; P < .001), 3-year citation rate (Spearman ρ, 0.69; 95% CI, 0.43-0.85; P < .001), number of QUADAS-2 low risk-of-bias domains (Spearman ρ, 0.56; 95% CI, 0.24-0.77; P = .002), and adherence to the TRIPOD statement (Spearman ρ, 0.87; 95% CI, 0.73-0.94; P < .001). Intraclass correlation coefficient ranges for interrater and intrarater reliability were 0.74 to 1.00 for individual items, 0.81 to 0.99 for individual domains, and 0.91 to 0.98 for overall scores. Conclusions and Relevance: In this quality improvement study, APPRAISE-AI demonstrated strong interrater and intrarater reliability and correlated well with several study quality measures. This tool may provide a quantitative approach for investigators, reviewers, editors, and funding organizations to compare the research quality across AI studies for clinical decision support.


Subject(s)
Artificial Intelligence , Decision Support Systems, Clinical , Humans , Reproducibility of Results , Machine Learning , Clinical Relevance
5.
Can Urol Assoc J ; 17(11): E395-E401, 2023 Nov.
Article in English | MEDLINE | ID: mdl-37549345

ABSTRACT

INTRODUCTION: The use of artificial intelligence (AI) in urology is gaining significant traction. While previous reviews of AI applications in urology exist, there have been few attempts to synthesize existing literature on urothelial cancer (UC). METHODS: Comprehensive searches based on the concepts of "AI" and "urothelial cancer" were conducted in MEDLINE , EMBASE , Web of Science, and Scopus. Study selection and data abstraction were conducted by two independent reviewers. Two independent raters assessed study quality in a random sample of 25 studies with the prediction model risk of bias assessment tool (PROBAST) and the standardized reporting of machine learning applications in urology (STREAM-URO) framework. RESULTS: From a database search of 4581 studies, 227 were included. By area of research, 33% focused on image analysis, 26% on genomics, 16% on radiomics, and 15% on clinicopathology. Thematic content analysis identified qualitative trends in AI models employed and variables for feature extraction. Only 19% of studies compared performance of AI models to non-AI methods. All selected studies demonstrated high risk of bias for analysis and overall concern with Cohen's kappa (k)=0.68. Selected studies met 66% of STREAM-URO items, with k=0.76. CONCLUSIONS: The use of AI in UC is a topic of increasing importance; however, there is a need for improved standardized reporting, as evidenced by the high risk of bias and low methodologic quality identified in the included studies.

6.
Oral Dis ; 2023 Jul 01.
Article in English | MEDLINE | ID: mdl-37392423

ABSTRACT

OBJECTIVES: This systematic review aimed at evaluating the performance of artificial intelligence (AI) models in detecting dental caries on oral photographs. METHODS: Methodological characteristics and performance metrics of clinical studies reporting on deep learning and other machine learning algorithms were assessed. The risk of bias was evaluated using the quality assessment of diagnostic accuracy studies 2 (QUADAS-2) tool. A systematic search was conducted in EMBASE, Medline, and Scopus. RESULTS: Out of 3410 identified records, 19 studies were included with six and seven studies having low risk of biases and applicability concerns for all the domains, respectively. Metrics varied widely and were assessed on multiple levels. F1-scores for classification and detection tasks were 68.3%-94.3% and 42.8%-95.4%, respectively. Irrespective of the task, F1-scores were 68.3%-95.4% for professional cameras, 78.8%-87.6%, for intraoral cameras, and 42.8%-80% for smartphone cameras. Limited studies allowed assessing AI performance for lesions of different severity. CONCLUSION: Automatic detection of dental caries using AI may provide objective verification of clinicians' diagnoses and facilitate patient-clinician communication and teledentistry. Future studies should consider more robust study designs, employ comparable and standardized metrics, and focus on the severity of caries lesions.

7.
JAMIA Open ; 6(3): ooad046, 2023 Oct.
Article in English | MEDLINE | ID: mdl-37425489

ABSTRACT

Background: Standard ontologies are critical for interoperability and multisite analyses of health data. Nevertheless, mapping concepts to ontologies is often done with generic tools and is labor-intensive. Contextualizing candidate concepts within source data is also done in an ad hoc manner. Methods and Results: We present AnnoDash, a flexible dashboard to support annotation of concepts with terms from a given ontology. Text-based similarity is used to identify likely matches, and large language models are used to improve ontology ranking. A convenient interface is provided to visualize observations associated with a concept, supporting the disambiguation of vague concept descriptions. Time-series plots contrast the concept with known clinical measurements. We evaluated the dashboard qualitatively against several ontologies (SNOMED CT, LOINC, etc.) by using MIMIC-IV measurements. The dashboard is web-based and step-by-step instructions for deployment are provided, simplifying usage for nontechnical audiences. The modular code structure enables users to extend upon components, including improving similarity scoring, constructing new plots, or configuring new ontologies. Conclusion: AnnoDash, an improved clinical terminology annotation tool, can facilitate data harmonizing by promoting mapping of clinical data. AnnoDash is freely available at https://github.com/justin13601/AnnoDash (https://doi.org/10.5281/zenodo.8043943).

8.
Int J Nurs Stud ; 145: 104529, 2023 Sep.
Article in English | MEDLINE | ID: mdl-37307638

ABSTRACT

BACKGROUND: Institutions struggle with successful use of sepsis alerts within electronic health records. OBJECTIVE: Test the association of sepsis screening measurement criteria in discrimination of mortality and detection of sepsis in a large dataset. DESIGN: Retrospective, cohort study using a large United States (U.S.) intensive care database. The Institutional Review Board exempt status was obtained from Kansas University Medical Center Human Research Protection Program (10-1-2015). SETTING: 334 U.S. hospitals participating in the eICU Research Institute. PARTICIPANTS: Nine hundred twelve thousand five hundred and nine adult intensive care admissions from 183 hospitals. METHODS: Exposures included: systemic inflammatory response syndrome criteria ≥ 2 (Sepsis-1); systemic inflammatory response syndrome criteria with organ failure criteria ≥ 3.5 points (Sepsis-2); and sepsis-related organ failure assessment score ≥ 2 and quick score ≥ 2 (Sepsis-3). Discrimination of outcomes was determined with/without (adjusted/unadjusted) baseline risk exposure to a model. The receiver operating characteristic curve (AUROC) and odds ratios (ORs) for each decile of baseline risk of sepsis or death were assessed. RESULTS: Within the eligible cohort of 912,509, a total of 86,219 (9.4 %) patients did not survive their hospital stay and 186,870 (20.5 %) met the definition of suspected sepsis. For suspected sepsis discrimination, Sepsis-2 (unadjusted AUROC 0.67, 99 % CI: 0.66-0.67 and adjusted AUROC 0.77, 99 % CI: 0.77-0.77) outperformed Sepsis-3 (SOFA unadjusted AUROC 0.61, 99 % CI: 0.61-0.61 and adjusted AUROC 0.74, 99 % CI: 0.74-0.74) (qSOFA unadjusted AUROC 0.59, 99 % CI: 0.59-0.60 and adjusted AUROC 0.73, 99 % CI: 0.73-0.73). Sepsis-2 also outperformed Sepsis-1 (unadjusted AUROC 0.58, 99 % CI: 0.58-0.58 and adjusted AUROC 0.73, 99 % CI: 0.73-0.73). In between differences of AUROCs were statistically significantly different. Sepsis-2 ORs were higher for the outcome of suspected sepsis when considering deciles of risk than the other measurement systems. CONCLUSIONS AND RELEVANCE: Sepsis-2 outperformed other systems in suspected sepsis detection and was comparable to SOFA in prognostic accuracy of mortality in adult intensive care patients.


Subject(s)
Sepsis , Humans , Adult , Cohort Studies , Retrospective Studies , Hospital Mortality , Sepsis/diagnosis , Intensive Care Units , Prognosis , ROC Curve
9.
Lancet Digit Health ; 5(7): e435-e445, 2023 07.
Article in English | MEDLINE | ID: mdl-37211455

ABSTRACT

BACKGROUND: Accurate prediction of side-specific extraprostatic extension (ssEPE) is essential for performing nerve-sparing surgery to mitigate treatment-related side-effects such as impotence and incontinence in patients with localised prostate cancer. Artificial intelligence (AI) might provide robust and personalised ssEPE predictions to better inform nerve-sparing strategy during radical prostatectomy. We aimed to develop, externally validate, and perform an algorithmic audit of an AI-based Side-specific Extra-Prostatic Extension Risk Assessment tool (SEPERA). METHODS: Each prostatic lobe was treated as an individual case such that each patient contributed two cases to the overall cohort. SEPERA was trained on 1022 cases from a community hospital network (Trillium Health Partners; Mississauga, ON, Canada) between 2010 and 2020. Subsequently, SEPERA was externally validated on 3914 cases across three academic centres: Princess Margaret Cancer Centre (Toronto, ON, Canada) from 2008 to 2020; L'Institut Mutualiste Montsouris (Paris, France) from 2010 to 2020; and Jules Bordet Institute (Brussels, Belgium) from 2015 to 2020. Model performance was characterised by area under the receiver operating characteristic curve (AUROC), area under the precision recall curve (AUPRC), calibration, and net benefit. SEPERA was compared against contemporary nomograms (ie, Sayyid nomogram, Soeterik nomogram [non-MRI and MRI]), as well as a separate logistic regression model using the same variables included in SEPERA. An algorithmic audit was performed to assess model bias and identify common patient characteristics among predictive errors. FINDINGS: Overall, 2468 patients comprising 4936 cases (ie, prostatic lobes) were included in this study. SEPERA was well calibrated and had the best performance across all validation cohorts (pooled AUROC of 0·77 [95% CI 0·75-0·78] and pooled AUPRC of 0·61 [0·58-0·63]). In patients with pathological ssEPE despite benign ipsilateral biopsies, SEPERA correctly predicted ssEPE in 72 (68%) of 106 cases compared with the other models (47 [44%] in the logistic regression model, none in the Sayyid model, 13 [12%] in the Soeterik non-MRI model, and five [5%] in the Soeterik MRI model). SEPERA had higher net benefit than the other models to predict ssEPE, enabling more patients to safely undergo nerve-sparing. In the algorithmic audit, no evidence of model bias was observed, with no significant difference in AUROC when stratified by race, biopsy year, age, biopsy type (systematic only vs systematic and MRI-targeted biopsy), biopsy location (academic vs community), and D'Amico risk group. According to the audit, the most common errors were false positives, particularly for older patients with high-risk disease. No aggressive tumours (ie, grade >2 or high-risk disease) were found among false negatives. INTERPRETATION: We demonstrated the accuracy, safety, and generalisability of using SEPERA to personalise nerve-sparing approaches during radical prostatectomy. FUNDING: None.


Subject(s)
Artificial Intelligence , Prostate , Male , Humans , Retrospective Studies , Prostatectomy , Risk Assessment
11.
Chest ; 164(2): 355-368, 2023 08.
Article in English | MEDLINE | ID: mdl-37040818

ABSTRACT

BACKGROUND: Evidence regarding acute kidney injury associated with concomitant administration of vancomycin and piperacillin-tazobactam is conflicting, particularly in patients in the ICU. RESEARCH QUESTION: Does a difference exist in the association between commonly prescribed empiric antibiotics on ICU admission (vancomycin and piperacillin-tazobactam, vancomycin and cefepime, and vancomycin and meropenem) and acute kidney injury? STUDY DESIGN AND METHODS: This was a retrospective cohort study using data from the eICU Research Institute, which contains records for ICU stays between 2010 and 2015 across 335 hospitals. Patients were enrolled if they received vancomycin and piperacillin-tazobactam, vancomycin and cefepime, or vancomycin and meropenem exclusively. Patients initially admitted to the ED were included. Patients with hospital stay duration of < 1 h, receiving dialysis, or with missing data were excluded. Acute kidney injury was defined as Kidney Disease: Improving Global Outcomes stage 2 or 3 based on serum creatinine component. Propensity score matching was used to match patients in the control (vancomycin and meropenem or vancomycin and cefepime) and treatment (vancomycin and piperacillin-tazobactam) groups, and ORs were calculated. Sensitivity analyses were performed to study the effect of longer courses of combination therapy and patients with renal insufficiency on admission. RESULTS: Thirty-five thousand six hundred fifty-four patients met inclusion criteria (vancomycin and piperacillin-tazobactam, n = 27,459; vancomycin and cefepime, n = 6,371; vancomycin and meropenem, n = 1,824). Vancomycin and piperacillin-tazobactam was associated with a higher risk of acute kidney injury and initiation of dialysis when compared with that of both vancomycin and cefepime (Acute kidney injury: OR, 1.37 [95% CI, 1.25-1.49]; dialysis: OR, 1.28 [95% CI, 1.14-1.45]) and vancomycin and meropenem (Acute kidney injury: OR, 1.27 [95%, 1.06-1.52]; dialysis: OR, 1.56 [95% CI, 1.23-2.00]). The odds of acute kidney injury developing was especially pronounced in patients without renal insufficiency receiving a longer duration of vancomycin and piperacillin-tazobactam therapy compared with vancomycin and meropenem therapy. INTERPRETATION: VPT is associated with a higher risk of acute kidney injury than both vancomycin and cefepime and vancomycin and meropenem in patients in the ICU, especially for patients with normal initial kidney function requiring longer durations of therapy. Clinicians should consider vancomycin and meropenem or vancomycin and cefepime to reduce the risk of nephrotoxicity for patients in the ICU.


Subject(s)
Acute Kidney Injury , Anti-Bacterial Agents , Humans , Anti-Bacterial Agents/therapeutic use , Cefepime/adverse effects , Vancomycin/adverse effects , Retrospective Studies , Meropenem/adverse effects , Critical Illness/therapy , Piperacillin/adverse effects , Drug Therapy, Combination , Piperacillin, Tazobactam Drug Combination/adverse effects , Acute Kidney Injury/chemically induced , Acute Kidney Injury/epidemiology
13.
Sci Data ; 10(1): 1, 2023 01 03.
Article in English | MEDLINE | ID: mdl-36596836

ABSTRACT

Digital data collection during routine clinical practice is now ubiquitous within hospitals. The data contains valuable information on the care of patients and their response to treatments, offering exciting opportunities for research. Typically, data are stored within archival systems that are not intended to support research. These systems are often inaccessible to researchers and structured for optimal storage, rather than interpretability and analysis. Here we present MIMIC-IV, a publicly available database sourced from the electronic health record of the Beth Israel Deaconess Medical Center. Information available includes patient measurements, orders, diagnoses, procedures, treatments, and deidentified free-text clinical notes. MIMIC-IV is intended to support a wide array of research studies and educational material, helping to reduce barriers to conducting clinical research.


Subject(s)
Electronic Health Records , Humans , Databases, Factual , Hospitals
14.
J Am Med Inform Assoc ; 30(4): 718-725, 2023 03 16.
Article in English | MEDLINE | ID: mdl-36688534

ABSTRACT

OBJECTIVE: Convert the Medical Information Mart for Intensive Care (MIMIC)-IV database into Health Level 7 Fast Healthcare Interoperability Resources (FHIR). Additionally, generate and publish an openly available demo of the resources, and create a FHIR Implementation Guide to support and clarify the usage of MIMIC-IV on FHIR. MATERIALS AND METHODS: FHIR profiles and terminology system of MIMIC-IV were modeled from the base FHIR R4 resources. Data and terminology were reorganized from the relational structure into FHIR according to the profiles. Resources generated were validated for conformance with the FHIR profiles. Finally, FHIR resources were published as newline delimited JSON files and the profiles were packaged into an implementation guide. RESULTS: The modeling of MIMIC-IV in FHIR resulted in 25 profiles, 2 extensions, 35 ValueSets, and 34 CodeSystems. An implementation guide encompassing the FHIR modeling can be accessed at mimic.mit.edu/fhir/mimic. The generated demo dataset contained 100 patients and over 915 000 resources. The full dataset contained 315 000 patients covering approximately 5 840 000 resources. The final datasets in NDJSON format are accessible on PhysioNet. DISCUSSION: Our work highlights the challenges and benefits of generating a real-world FHIR store. The challenges arise from terminology mapping and profiling modeling decisions. The benefits come from the extensively validated openly accessible data created as a result of the modeling work. CONCLUSION: The newly created MIMIC-IV on FHIR provides one of the first accessible deidentified critical care FHIR datasets. The extensive real-world data found in MIMIC-IV on FHIR will be invaluable for research and the development of healthcare applications.


Subject(s)
Health Level Seven , Information Dissemination , Information Storage and Retrieval , Patients , Information Storage and Retrieval/methods , Information Storage and Retrieval/standards , Humans , Datasets as Topic , Reproducibility of Results , Electronic Health Records , Information Dissemination/methods
15.
Am J Respir Crit Care Med ; 207(3): 271-282, 2023 02 01.
Article in English | MEDLINE | ID: mdl-36150166

ABSTRACT

Rationale: Invasive ventilation is a significant event for patients with respiratory failure. Physiologic thresholds standardize the use of invasive ventilation in clinical trials, but it is unknown whether thresholds prompt invasive ventilation in clinical practice. Objectives: To measure, in patients with hypoxemic respiratory failure, the probability of invasive ventilation within 3 hours after meeting physiologic thresholds. Methods: We studied patients admitted to intensive care receiving FiO2 of 0.4 or more via nonrebreather mask, noninvasive positive pressure ventilation, or high-flow nasal cannula, using data from the Medical Information Mart for Intensive Care (MIMIC)-IV database (2008-2019) and the Amsterdam University Medical Centers Database (AmsterdamUMCdb) (2003-2016). We evaluated 17 thresholds, including the ratio of arterial to inspired oxygen, the ratio of saturation to inspired oxygen ratio, composite scores, and criteria from randomized trials. We report the probability of invasive ventilation within 3 hours of meeting each threshold and its association with covariates using odds ratios (ORs) and 95% credible intervals (CrIs). Measurements and Main Results: We studied 4,726 patients (3,365 from MIMIC, 1,361 from AmsterdamUMCdb). Invasive ventilation occurred in 28% (1,320). In MIMIC, the highest probability of invasive ventilation within 3 hours of meeting a threshold was 20%, after meeting prespecified neurologic or respiratory criteria while on vasopressors, and 19%, after a ratio of arterial to inspired oxygen of <80 mm Hg. In AmsterdamUMCdb, the highest probability was 34%, after vasopressor initiation, and 25%, after a ratio of saturation to inspired oxygen of <90. The probability after meeting the threshold from randomized trials was 9% (MIMIC) and 13% (AmsterdamUMCdb). In MIMIC, a race/ethnicity of Black (OR, 0.75; 95% CrI, 0.57-0.96) or Asian (OR, 0.6; 95% CrI, 0.35-0.95) compared with White was associated with decreased probability of invasive ventilation after meeting a threshold. Conclusions: The probability of invasive ventilation within 3 hours of meeting physiologic thresholds was low and associated with patient race/ethnicity.


Subject(s)
Noninvasive Ventilation , Respiratory Insufficiency , Humans , Noninvasive Ventilation/adverse effects , Cohort Studies , Intubation, Intratracheal , Hypoxia/complications , Respiratory Insufficiency/etiology , Oxygen , Cannula , Oxygen Inhalation Therapy
16.
JAMIA Open ; 5(4): ooac105, 2022 Dec.
Article in English | MEDLINE | ID: mdl-36570030

ABSTRACT

EHR-based sepsis research often uses heterogeneous definitions of sepsis leading to poor generalizability and difficulty in comparing studies to each other. We have developed OpenSep, an open-source pipeline for sepsis phenotyping according to the Sepsis-3 definition, as well as determination of time of sepsis onset and SOFA scores. The Minimal Sepsis Data Model was developed alongside the pipeline to enable the execution of the pipeline to diverse sources of electronic health record data. The pipeline's accuracy was validated by applying it to the MIMIC-IV version 1.0 data and comparing sepsis onset and SOFA scores to those produced by the pipeline developed by the curators of MIMIC. We demonstrated high reliability between both the sepsis onsets and SOFA scores, however the use of the Minimal Sepsis Data model developed for this work allows our pipeline to be applied to more broadly to data sources beyond MIMIC.

17.
JMIR Med Inform ; 10(11): e40039, 2022 Nov 17.
Article in English | MEDLINE | ID: mdl-36394938

ABSTRACT

BACKGROUND: Given the costs of machine learning implementation, a systematic approach to prioritizing which models to implement into clinical practice may be valuable. OBJECTIVE: The primary objective was to determine the health care attributes respondents at 2 pediatric institutions rate as important when prioritizing machine learning model implementation. The secondary objective was to describe their perspectives on implementation using a qualitative approach. METHODS: In this mixed methods study, we distributed a survey to health system leaders, physicians, and data scientists at 2 pediatric institutions. We asked respondents to rank the following 5 attributes in terms of implementation usefulness: the clinical problem was common, the clinical problem caused substantial morbidity and mortality, risk stratification led to different actions that could reasonably improve patient outcomes, reducing physician workload, and saving money. Important attributes were those ranked as first or second most important. Individual qualitative interviews were conducted with a subsample of respondents. RESULTS: Among 613 eligible respondents, 275 (44.9%) responded. Qualitative interviews were conducted with 17 respondents. The most common important attributes were risk stratification leading to different actions (205/275, 74.5%) and clinical problem causing substantial morbidity or mortality (177/275, 64.4%). The attributes considered least important were reducing physician workload and saving money. Qualitative interviews consistently prioritized implementations that improved patient outcomes. CONCLUSIONS: Respondents prioritized machine learning model implementation where risk stratification would lead to different actions and clinical problems that caused substantial morbidity and mortality. Implementations that improved patient outcomes were prioritized. These results can help provide a framework for machine learning model implementation.

18.
19.
Sci Data ; 9(1): 487, 2022 08 10.
Article in English | MEDLINE | ID: mdl-35948551

ABSTRACT

Chest radiographs allow for the meticulous examination of a patient's chest but demands specialized training for proper interpretation. Automated analysis of medical imaging has become increasingly accessible with the advent of machine learning (ML) algorithms. Large labeled datasets are key elements for training and validation of these ML solutions. In this paper we describe the Brazilian labeled chest x-ray dataset, BRAX: an automatically labeled dataset designed to assist researchers in the validation of ML models. The dataset contains 24,959 chest radiography studies from patients presenting to a large general Brazilian hospital. A total of 40,967 images are available in the BRAX dataset. All images have been verified by trained radiologists and de-identified to protect patient privacy. Fourteen labels were derived from free-text radiology reports written in Brazilian Portuguese using Natural Language Processing.


Subject(s)
Algorithms , Natural Language Processing , Radiography, Thoracic , Brazil , Humans , X-Rays
20.
Respir Care ; 2022 Jul 22.
Article in English | MEDLINE | ID: mdl-35868844

ABSTRACT

PURPOSE: Driving pressure (ΔP) and mechanical power (MP) may be important mediators of lung injury in acute respiratory distress syndrome (ARDS) however there is little evidence for strategies directed at lowering these parameters. We applied predictive modeling to estimate the effects of modifying ventilator parameters on ΔP and MP. METHODS: 2,622 ARDS patients (Berlin criteria) from the Medical Information Mart for Intensive Care IV database (MIMIC-IV version1.0) admitted to the intensive care unit (ICU) at Beth Israel Deaconess Medical Center between 2008 and 2019 were included. Flexible confounding-adjusted regression models for time varying data were fit to estimate the effects of adjusting PEEP and tidal volume (VT) on ΔP, and adjusting VT and respiratory rate (f) on MP. RESULTS: Reduction in VT reduced ΔP and MP, with more pronounced effect on MP with lower compliance. Strategies reducing f, consistently increased MP (when VT was adjusted to maintain consistent minute ventilation). Adjustment of PEEP yielded a U-shaped effect on ΔP. CONCLUSIONS: This novel conditional modeling confirmed expected response patterns for ΔP, with the response to adjustments depending on patients' lung mechanics. Furthermore a VT -driven approach should be favored over a f -driven approach when aiming to reduce MP.

SELECTION OF CITATIONS
SEARCH DETAIL
...